334

Question 10.5

Parsimony: The phylogenetic tree is calculated in such a way that the observed diver­

sity from the (not observed, but only calculated) precursor sequences is correctly repro­

duced with as little parsing as possible.

• ML, Maximum likelihood the phylogenetic tree is calculated as it probably has been

(single probabilities for each nucleotide exchange are considered). Calculation point

out, ideally take the same FASTA multisequence file.

Question 10.6

Take the NCBI download and also the taxonomy option of BLAST. First use a keyword

search to find the HI virus together with the complete polymerase sequence, e.g.

HIV1 human;

https://www.ncbi.nlm.nih.gov/protein/?term=HIV1+and+human+and+polymerase+co

mplete. Is so already feasible. But if you, for example, simply take HIV and protein and

human as search terms, then you can search yourself to death with so many hits.

They then find for man:

>gi|1906384|gb|AAB50259.1| pol polyprotein (NH2-terminus

uncertain) [Human immunodeficiency virus 1]

M S L P G R W K P K M I G G I G G F I K V R Q Y D Q I L I E I C G H K A

IGTVLVGPTPVNIIGRNLLTQIGCTLNFPISPIETVPVKLKPGMDGPKVKQW

P L T E E K I K A L V E I C T E M E K E G K I S K I G P E N P Y N T P V F A

I K K K D S T K W R K L V D F R E L N K R T Q D F W E V Q L G I P H P A G

L K K K K S V T V L D V G D A Y F S V P L D E D F R K Y T A F T I P S I N N E T

P G I R Y Q Y N V L P Q G W K G S P A I F Q S S M T K I L E P F R K Q N P D I V I Y Q

Y M D D L Y V G S D L E I G Q H R T K I E E L R Q H L L R W G L T T P D K K H Q K

E P P F L W M G Y E L H P D K W T V Q P I V L P E K D S W T V N D I Q K L V

G K L N W A S Q I Y P G I K V R Q L C K L L R G T K A L T E V I P L T E E A E L E L A

E N R E I L K E P V H G V Y Y D P S K D L I A E I Q K Q G Q G Q W T Y Q I Y Q E P F

K N L K T G K Y A R M R G A H T N D V K Q L T E A V Q K I T T E S I V I W G K T

P K F K L P I Q K E T W E T W W T E Y W Q A T W I P E W E F V N T P P L V K L

W Y Q L E K E P I V G A E T F Y V D G A A N R E T K L G K A G Y V T N R G R Q

K V V T L T D T T N Q K T E L Q A I Y L A L Q D S G L E V N I V T D S Q Y A L G I

I Q A Q P D Q S E S E L V N Q I I E Q L I K K E K V Y L A W V P A H K G I G G N E

Q V D K L V S A G I R K V L F L D G I D K A Q D E H E K Y H S N W R A M A S D F

N L P P V V A K E I V A S C D K C Q L K G E A M H G Q V D C S P G I W Q L D C T

H L E G K V I L V A V H V A S G Y I E A E V I P A E T G Q E T A Y F L L K L A G R W

P V K T I H T D N G S N F T G A T V R A A C W W A G I K Q E F G I P Y N P Q S Q G

V V E S M N K E L K K I I G Q V R D Q A E H L K T A V Q M A V F I H N F K R K G G

I G G Y S A G E R I V D I I A T D I Q T K E L Q K Q I T K I Q N F R V Y Y R D S R N P L

W K G P A K L L W K G E G A V V I Q D N S D I K V V P R R K A K I I R

DYGKQMAGDDCVASRQDED

20  Solutions to the Exercises